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Abstract 

Background: In chronic hepatitis C virus (HCV) infection, liver tissue pathology and HCV genotype are important 
determinants of clinical and/or treatment-related outcome. Although consistent epidemiological and/or 
molecular-biological clues derived from different studies on single virus-host interactions are meanwhile published, 
the in vivo transcriptional responses and cellular pathways affected in >1 key aspects of the disease or treatment 
process are far from being understood. 

Methods: Microarray analysis was performed in peripheral whole blood (PB) samples from 36 therapy-naive 
HCV-infected patients with known liver histology. Linear regression analysis identified gene expression profiles 
significantly correlating (P < 0.01 5) with >1 out of 7 variables: sustained viral response (SVR), viral non-response (NR), 
end of treatment viral response (ETR), viral breakthrough (VB), HCV genotype (Gt. 1 vs. Gt. 2/3), stage of hepatic 
fibrosis [St. 0/1 vs. St. 2/3/4] and grade of hepatic inflammation (Gr. 0/1 vs. Gr. 2/3/4). Correlation values across all 
seven contrasts were considered for hierarchical clustering (HCL). 

Results: A total of 1,697 genes showed >1 significant correlation results and genes involved in cell differentiation 
(183), immune response (53), and apoptosis (170) were leading fractions. HCL grouped the genes into six major 
clusters. Functional annotation analysis using DAVID (http://david.abcc.ncifcrf.gov) revealed that expression profiles 
that best linked these variables were highly enriched in cytokine/chemokine activity (Fisher-exact P< 0.0001) and 
specific biological module-centric algorithms finally led our focus on four out of fifty-three immune response genes: 
SMAD family member 3 (SMAD3), interleukin 1 receptor accessory protein (IL1RAP), tumor necrosis factor receptor 
superfamily member 1A (TNFRSF1 A), and chemokine 'C-C motif receptor 5 (CCR5). Of those, TNFRSF1 A and CCR5 
showed significant correlation with two out of seven variables based on microarray and/or quantitative real-time 
polymerase chain reaction (qRT-PCR) data. 

Conclusion: We identified molecular targets of the innate and adaptive immune system and validated their 
transcriptional specificity in vivo suggesting significant involvement in two unique outcomes during HCV treatment. 
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Background 

Hepatitis C virus (HCV) is estimated to persistently infect 
about 170 million people worldwide [1]. Histological grade 
of hepatic inflammation and stage of hepatic fibrosis as well 
as HCV genotype (Gt.) are considered significant determi- 
nants of clinical outcome in terms of progression of liver 
disease, and/or response to treatment in patients with 
chronic HCV infection (cHCV) as: (1) persistent he-patic 
inflammation is considered a major determinant of liver 
fibrosis progression towards cirrhosis [2]; (2) previous re- 
search has detected an association between advanced 
hepatic fibrosis and lower rates of successful response (sus- 
tained viral response (SVR)) to Interferon-alpha [3] or pegy- 
lated Interferon-alpha and Ribavirin (Peg-IFN-a/RBV) [4]; 
and (3) patients infected with Gt2 or Gt.3 (80%) regularly 
achieve higher SVR rates relative to those infected with 
Gtl or Gt.4 (50%) [5]. To date, molecular targets/pathways 
significantly involved in multiple disease and/or treatment 
response determining host interactions with HCV have not 
been systematically studied or characterized. In this study, 
we analyzed gene expression profiles in peripheral whole 
blood (PB) of therapy-naive HCV-infected subjects that 
underwent liver biopsy in order to identify gene expression 
patterns reflective of host biology that is linked to >1 clin- 
ical and/or viral stratification clusters. To this end, rigorous 
multicontrast linear regression analysis was performed on 
gene expression profiles against seven variables reflecting 
the state of cHCV with or without therapy. Our final results 
demonstrated that tumor necrosis factor receptor super- 
family member 1A (TNFRSF1A) and chemokine 'C-C 
motif receptor 5 (CCR5) correlated significantly with two 
out of seven variables. These findings, which have also been 
successfully validated by quantitative real-time polymerase 
chain reaction (qRT-PCR) in a fraction of patients, repre- 
sent a promising basis for future molecular research aimed 
to provide comprehensive insights into genes/pathways 
with multiple in vivo impacts in viral hepatitis and/or de- 
velop anti-HCV therapies that target these genes/pathways. 

Methods 

Study subjects 

Therapy-naive Caucasian patients with cHCV (# = 36) 
received Peg-IFN-a2a (Pegasys® /Roche: 180 (ig/week) or 
Peg-IFN-a2b (PEG-Intron®/Schering-Plough: 1.5 [ig/kg/ 
week) and Ribavirin (RBV, Rebetol®/Schering-Plough, 
1000-1200 mg/d) for 48 weeks (Gtl) or 24 weeks (Gt.3) 
and were followed up for 24 weeks post treatment at the 
University Hospital in Essen, Germany. All donors signed 
informed consents approved by the local board for ethics. 

Liver biopsy 

Liver biopsies were obtained from all 36 patients prior to 
enrollment as part of their standard clinical assessments. 
All 36 specimens were evaluated using the Batts-Ludwig 



Grading (BL-G) and Staging (BL-S) system for viral hepa- 
titis [6]. 

Laboratory studies 

Quantitative HCV RNA was performed 12 h before 
administration of the first dose of Peg-IFN-a and later 
during each study visit. Liver chemistry and safety labora- 
tory tests were performed 12 h prior to treatment initi- 
ation and during each study visit. 

Microarray analysis 

RNA (PAXgene-Blood-RNA-Kit, Qiagen, Hilden, Germany) 
isolated from PB was collected 12 h before the first 
injection of Peg-IFN-a directly into PAXgene-Blood-RNA- 
Tubes (Qiagen, Hilden, Germany). Total RNA, cRNA syn- 
thesis, and labeling along with hybridization to the 
U133A_2 or U133A arrays (Asymetrix, St Clara, CA, USA) 
were performed according to the manufacturers protocol. 
Gene expression values (log2) were determined using a cor- 
rection for probe GC content, RMA background subtrac- 
tion, and Quantile normalization. Chip-type-related batch 
effect was removed by regression analysis (PARTEK Gen- 
omics Suite). 

Thirty-six patients were assigned to seven unique categor- 
ies based on the criteria shown in Table 1: therapy outcome 
(SVR; NR, viral non-response; VB, viral breakthrough; ETR, 
viral relapse or end of treatment viral response), HCV geno- 
type (Gt. 1 vs. Gt. 2/3), histological stage of hepatic fibrosis 
(St. 0/1 vs. St. 2/3/4), and histological grade of hepatic in- 
flammation (Gr. 0/1 vs. Gr. 2/3/4). In a first step, gene ex- 
pression profiles in PB were subjected to linear regression 
analysis against the abovementioned seven categorical vari- 
ables. Then, expression profiles of genes that passed the strict 
statistical cutoff of P< 0.015 in at least one contrast were 
selected and further subjected to unsupervised hierarchical 

Table 1 Distribution of patients across seven categorical 
disease /therapy-related outcomes 



Symbol Categorical title Count (n) 



SVR 


Sustained viral response 


16/36 


ETR 


End of treatment viral response 


7/36 


VB 


Viral breakthrough 


5/36 


NR 


Viral non-response 


8/36 


Gt. 2,3 


HCV genotype 2 or 3 


6/36 


Fibrosis St. 0,1 


Histological stage of hepatic 
Fibrosis 0 or 1 

(Batts-Ludwig classification) 


21/36 


Inflammation Gr. 0,1 


Histological grade of hepatic 
Inflammation 0 or 1 

(Batts-Ludwig classification) 


23/36 



Table 1 gives the numerical distribution of patients (n = 36), whose samples 
were subjected to microarray analysis at baseline, among seven groups 
representing categorical variables as defined under the Methods section. 
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clustering (HCL) based on calculated linear regression values 
across all seven contrasts. Furthermore, functional annota- 
tion analysis using DAVID [7] was performed to identify bio- 
logical activity that these expression profiles were most 
highly enriched in. Specific biological module-centric algo- 
rithms finally led our focus on four genes significantly 
involved in 'immune response'. In a second step, we selected 
two out of those four genes by filtering the respective micro- 
array data based on which expression profiles had reached 
statistical significance in at least one additional contrast. 
These results were validated by PCR in a fraction of patients 
(n = 9), of which RNA samples were currently available. 

Details on gene selection algorithms and clustering along 
with statistical degree and magnitude of associations are 
provided under the Results section. Correlation heat maps 
as well as clustering of microarray data have been performed 
using PARTEK Genomics Suite. 

qRT-PCR 

Real-time detection of target gene messenger RNAs 
(mRNAs) with one-step quantitative PCR was performed 
on the Rotor-Gene 2000 real-time amplification system 
(Corbett Research). One-step qRT-PCR was carried out with 
the QuantiTect SYBR Green RT-PCR Kit (Qiagen) accord- 
ing to the manufacturer s instructions. For the mRNA popu- 
lation per 1 gene, copy numbers were normalized to the 
number of beta-actin (p-actin) transcripts. Furthermore, the 



TNFRSF1A and CCR5 gene transcripts were quantified by 
qRT-PCR using specific QuantiTect Primer Assays 
(Qiagen). 

Statistical analysis 

Linear regression analysis was performed using one-way 
ANOVA. Statistics and Graphics were performed in PAR- 
TEK Genomics Suite. The significance cutoff was chosen 
not to exceed 5% (P<0.05). As shown elsewhere [8], for 
analysis checking two contrasts at once we relaxed P to <0.1 
only for one contrast in order not to risk missing significant 
biology and obtain a slightly larger gene set for validation 
qRT-PCR. 

Results 

In this study, we analyzed expression patterns of genes sig- 
nificantly correlating with one or more HCV-related para- 
meters, liver disease outcomes and/or responses to anti- 
HCV treatment. More specifically, gene expression profiles 
in PB of therapy-naive patients with cHCV were subjected 
to linear regression analysis against seven categorical vari- 
ables shown in Table 1. Gene expression profiles (n = 1,697) 
that reached at least one highly significant correlation result 
(r > 0.4 or r < -0.4, P < 0.015) were selected and further sub- 
jected to unsupervised hierarchical clustering (HCL) using 
linear regression values calculated across all seven contrasts 
(Figure 1). HCL grouped the genes into six major clusters 
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providing a detailed view of the relationship between gene 
expression profiles, clinic- or therapy-related outcomes and 
viral characteristics. Genes involved in immune response 

= 53), cell differentiation (# = 183), and apoptosis 
(n = 170) were leading fractions across these clusters based 
on functional annotation analysis using DAVID [7]. Expres- 
sion profiles that best linked these parameters were highly 
enriched in cytokine/chemokine activity' (DAVID [7], Fisher 
exact P< 0.0001). 

Then, we focused on four out of the fifty-three genes 
involved in 'immune response' through specific biological 
module-centric algorithms provided by DAVID [7], that is 
SMAD family member 3 (SMAD3), interleukin 1 receptor 
accessory protein (IL1RAP), TNFRSF1A, and CCR5. 
Among those, CCR5 gene expression (microarray) showed 
a strong positive correlation with SVR (r = 0.29, P = 0.08) 
and an inverse correlation with VB (r = -0.38, P = 0.02, 
Additional file 1: Table SI) at baseline. TNFRSF1A gene ex- 
pression (microarray) exhibited positive correlation with 
SVR (r = 0.41, P = 0.01) and an inverse correlation with NR 
(r=-0.35, P = 0.03, Additional file 1: Table SI) at baseline. 
In line with these findings were the results from the exact 
same type of analysis including gene expression data gener- 
ated by qRT-PCR. Thus, CCR5 gene expression (qRT-PCR) 
showed a positive correlation with SVR (r = 0.78, P = 0.01) 
and an inverse correlation with VB (r = -0.58, 
P = 0.09, Additional file 2: Table S2) at baseline. 
TNFRSF1A gene expression (qRT-PCR) exhibited a 
positive correlation with SVR (r = 0.64, P = 0.06) and 
an inverse correlation with NR (r=-0.57, P = 0.1, 
Additional file 2: Table S2) at baseline. Since only a 
small count of samples were available for qRT-PCR 
(n = 9), caution should be exerted when interpreting 
the P; therefore, mainly correlation values should be 
used as comparable measures between microarray 
and qRT-PCR results. 

Knowing this, we should also note that qRT-PCR data 
often showed a lower P relative to microarray data due 
to the higher correlation values calculated by one-way 
ANOVA in all contrasts. However, results obtained for 
TNFRSF1A and CCR5 on the level of both microarray 
and qRT-PCR analyses are statistically tight. Conse- 
quently, our approach considering these genes important 
players for multiple in vivo outcomes (despite some 
associations with the abovementioned variables at a 
relaxed P of <0.1 [8] in different settings, i.e. microarray 
vs. qRT-PCR) likely identified active and meaningful bio- 
logical relationships. Results for SMAD3 reached signifi- 
cance in no contrast based on qRT-PCR expression 
values and showed predominantly inconsistent trends 
relative to microarray data (Additional file 1: Tables SI 
and Additional file 2: Table S2). Interestingly, IL1RAP 
showed only on the level of qRT-PCR data analysis a 
positive correlation with SVR (r = 0.75, P = 0.01). 



Taken together, this study identified molecular mar- 
kers/pathways of the innate and adaptive immune sys- 
tem that potentially exert multifaceted in vivo roles in 
cHCV as indicated by statistical correlations between 
two unique disease-related outcomes and gene expres- 
sion profiles detected by microarray analysis and vali- 
dated by qRT-PCR. 

Discussion 

The present study was aimed to identify human genes/ 
pathways participating in multiple host interactions with 
HCV in vivo. The analytical design chosen provided a 
unique opportunity to use the microarray-technique as a 
molecular readout for the differential regulation of 
genes/pathways with diverse potential implications in 
cHCV. Thus, pattern analysis of microarray data based 
on gene expression significantly correlating with one or 
more viral characteristics, histological liver pathologies, 
and/or final treatment responses led to identification of 
specific gene clusters strongly associated with the cyto- 
kine/chemokine activity in vivo. These observations ap- 
pear biologically consistent with previous research 
linking liver disease and treatment outcome during 
cHCV to responses of both the innate and adaptive im- 
mune system orchestrated by networks of cytokines and 
chemokines [9,10]. Particularly, various cytokines and 
chemokines are reportedly involved in affecting host sus- 
ceptibility to developing chronic hepatitis C or response 
of cHCV to IFN-a therapy [11,12]. 

Using microarray analysis, combined with functional an- 
notation tools such as DAVID [7], we detected specific 
changes in gene expression of four molecular markers 
assigned to the functional annotation category 'immune 
response; that is SMAD3, IL1RAP, TNFRSF1A, and 
CCR5, which exhibited significant correlation with two 
categorical variables representing unique therapy out- 
comes. Validation qRT-PCR confirmed these results for 
TNFRSF1A and CCR5. In contrast, linear regression ana- 
lysis using qRT-PCR data for SMAD3 did not detect any 
significance; therefore, SMAD3 has been omitted from the 
Discussion section. As specific interactions between HCV 
and SMAD3 have been previously identified but are, how- 
ever, beyond this articles scope, interested readers are re- 
ferred for details to the respective original literature [13]. 

Moreover, increased TNFRSF1A gene expression at 
baseline was found in significant correlation with HCV 
elimination in terms of SVR and inverse relation to HCV 
persistence due to complete viral non-response (NR, that 
is non-ETR or non-VB). These observations are in agree- 
ment with previous publications showing up-regulation of 
TNFRSF1A expression in peripheral dendritic cells of 
HCV- infected patients who reach SVR [14]. 

In recent years, increasing amounts of evidence have 
indicated that spontaneous HCV clearance or successful 
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response to antiviral treatment display a predominant T 
helper 1 (Thl) lymphocyte-related cytokine/chemokine 
profile including CCL5 and its specific receptor CCR5 
[15]. In this light, the positive correlation between CCR5 
gene expression levels and SVR as well as the inverse 
correlation between CCR5 gene expression and VB, 
which is the most immediate type of treatment failure, 
both observed in the present study, strongly suggest the 
good prognostic value along with the potentially favor- 
able biological role of increased CCR5 expression in the 
peripheral blood at baseline. 

There are several limitations of this study that warrant 
comment. First, patients in relative low numbers failed 
to clear HCV due to ETR (n = 7) or VB (n = 5), which 
may have affected the statistical power of the linear re- 
gression model and introduced errors in calculation of 
significance in ETR- and/or VB-based contrasts. Second, 
this study was able to provide confirmation of significant 
correlations against validated results via qRT-PCR in 
only 25% (9/36) of patients whose samples were origin- 
ally subjected to microarray analysis. Third, protein ex- 
pression has not been investigated for identified gene 
targets; thus, posttranscriptional expression and true 
level of functional importance of these genes remains 
obscure. Future work will address these concerns in on- 
going clinical trials with larger numbers of HCV- 
infected subjects treated with Peg-IFN-a/RBV and/or 
new direct-acting antiviral agents. 

Conclusions 

In conclusion, this study provides foundation for greater 
understanding of the multifaceted role of the immune 
system in the outcome of chronic hepatitis C by associ- 
ating patient stratification based on standard clinical 
variables with biologically significant genes/pathways 
likely involved in numerous key aspects of the disease 
and treatment process in vivo. 

Additional files 



Additional file 1: Table SI. Final linear regression analysis results based 
on microarray data at baseline [16]. 

Additional file 2: Table S2. Final linear regression analysis results based 
on qRT-PCR data at baseline [16], 
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ETR: End of treatment response or viral relapse (HCV-RNA not detectable at 
the end of antiviral treatment with a subsequent increase in serum HCV RNA 
levels before the end of follow-up); NR: Viral non-response (presence of HCV 
RNA in serum at the end of a minimum of 24 weeks (for Gt. 2/3) or 
48 weeks (for Gt. 1/4) of therapy; SVR: Sustained viral response (HCV-RNA not 
detectable at the end of follow-up, that is week 24 after completion of 
antiviral treatment); VB: Viral breakthrough (an abrupt increase in serum HCV 
RNA levels after a period of persistent suppression before completion of 
antiviral treatment). 
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